Overview
Brought to you by YData
Dataset statistics
| Number of variables | 26 |
|---|---|
| Number of observations | 239671 |
| Missing cells | 285626 |
| Missing cells (%) | 4.6% |
| Duplicate rows | 697 |
| Duplicate rows (%) | 0.3% |
| Total size in memory | 47.5 MiB |
| Average record size in memory | 208.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Text | 8 |
| Categorical | 5 |
| DateTime | 3 |
| Dataset has 697 (0.3%) duplicate rows | Duplicates |
action is highly overall correlated with inspection_type | High correlation |
bbl is highly overall correlated with bin and 5 other fields | High correlation |
bin is highly overall correlated with bbl and 5 other fields | High correlation |
boro is highly overall correlated with bbl and 4 other fields | High correlation |
census_tract is highly overall correlated with bbl and 5 other fields | High correlation |
community_board is highly overall correlated with bbl and 6 other fields | High correlation |
council_district is highly overall correlated with bbl and 6 other fields | High correlation |
critical_flag is highly overall correlated with inspection_type | High correlation |
inspection_type is highly overall correlated with action and 1 other fields | High correlation |
latitude is highly overall correlated with council_district | High correlation |
longitude is highly overall correlated with census_tract and 2 other fields | High correlation |
zipcode is highly overall correlated with bbl and 6 other fields | High correlation |
action is highly imbalanced (85.1%) | Imbalance |
inspection_type is highly imbalanced (58.6%) | Imbalance |
zipcode has 2416 (1.0%) missing values | Missing |
score has 9228 (3.9%) missing values | Missing |
community_board has 2875 (1.2%) missing values | Missing |
council_district has 2865 (1.2%) missing values | Missing |
census_tract has 2865 (1.2%) missing values | Missing |
bin has 4099 (1.7%) missing values | Missing |
nta has 2875 (1.2%) missing values | Missing |
grade has 123298 (51.4%) missing values | Missing |
grade_date has 131148 (54.7%) missing values | Missing |
score has 8788 (3.7%) zeros | Zeros |
latitude has 2416 (1.0%) zeros | Zeros |
longitude has 2416 (1.0%) zeros | Zeros |
Reproduction
| Analysis started | 2024-11-19 06:02:50.685275 |
|---|---|
| Analysis finished | 2024-11-19 07:52:40.449030 |
| Duration | 1 hour, 49 minutes and 49.76 seconds |
| Software version | ydata-profiling vv4.12.0 |
| Download configuration | config.json |
Variables
camis
Real number (ℝ)
| Distinct | 26225 |
|---|---|
| Distinct (%) | 10.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 47702580 |
| Minimum | 30075445 |
|---|---|
| Maximum | 50159091 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 30075445 |
|---|---|
| 5-th percentile | 40673117 |
| Q1 | 41686665 |
| median | 50074627 |
| Q3 | 50113280 |
| 95-th percentile | 50142413 |
| Maximum | 50159091 |
| Range | 20083646 |
| Interquartile range (IQR) | 8426615 |
Descriptive statistics
| Standard deviation | 3957181.2 |
|---|---|
| Coefficient of variation (CV) | 0.082955286 |
| Kurtosis | -0.83362953 |
| Mean | 47702580 |
| Median Absolute Deviation (MAD) | 50524 |
| Skewness | -1.0638185 |
| Sum | 1.1432925 × 1013 |
| Variance | 1.5659283 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40365904 | 66 | < 0.1% |
| 50111296 | 64 | < 0.1% |
| 50105561 | 64 | < 0.1% |
| 50123073 | 63 | < 0.1% |
| 50089474 | 61 | < 0.1% |
| 41406895 | 58 | < 0.1% |
| 50079599 | 56 | < 0.1% |
| 40714228 | 53 | < 0.1% |
| 50128889 | 51 | < 0.1% |
| 50111191 | 49 | < 0.1% |
| Other values (26215) | 239086 |
| Value | Count | Frequency (%) |
| 30075445 | 16 | |
| 30191841 | 5 | < 0.1% |
| 40356018 | 3 | < 0.1% |
| 40356483 | 22 | |
| 40356731 | 8 | < 0.1% |
| 40357217 | 4 | < 0.1% |
| 40359480 | 4 | < 0.1% |
| 40359705 | 13 | |
| 40360045 | 8 | < 0.1% |
| 40361618 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 50159091 | 1 | < 0.1% |
| 50158928 | 6 | |
| 50158842 | 4 | < 0.1% |
| 50158815 | 13 | |
| 50158791 | 2 | < 0.1% |
| 50158779 | 4 | < 0.1% |
| 50158777 | 6 | |
| 50158741 | 1 | < 0.1% |
| 50158730 | 3 | < 0.1% |
| 50158722 | 3 | < 0.1% |
dba
Text
| Distinct | 20890 |
|---|---|
| Distinct (%) | 8.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
Length
| Max length | 100 |
|---|---|
| Median length | 63 |
| Mean length | 16.032524 |
| Min length | 2 |
Unique
| Unique | 777 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | LA AURORA |
|---|---|
| 2nd row | MAMA'S RESTAURANT |
| 3rd row | NY 99 CENTS FRESH PIZZA |
| 4th row | AUTHENTIC FLAVAZ |
| 5th row | SUSHI TIME |
| Value | Count | Frequency (%) |
| restaurant | 26852 | 4.2% |
| 21354 | 3.3% | |
| cafe | 12630 | 2.0% |
| pizza | 11377 | 1.8% |
| bar | 9830 | 1.5% |
| bakery | 9043 | 1.4% |
| the | 8509 | 1.3% |
| coffee | 7370 | 1.1% |
| grill | 6225 | 1.0% |
| dunkin | 5119 | 0.8% |
| Other values (14456) | 522921 |
Most occurring characters
| Value | Count | Frequency (%) |
| 402433 | 10.5% | |
| A | 382411 | 10.0% |
| E | 345814 | 9.0% |
| R | 241271 | 6.3% |
| N | 235867 | 6.1% |
| I | 228027 | 5.9% |
| S | 216458 | 5.6% |
| O | 204645 | 5.3% |
| T | 202523 | 5.3% |
| L | 149821 | 3.9% |
| Other values (84) | 1233261 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3842531 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 402433 | 10.5% | |
| A | 382411 | 10.0% |
| E | 345814 | 9.0% |
| R | 241271 | 6.3% |
| N | 235867 | 6.1% |
| I | 228027 | 5.9% |
| S | 216458 | 5.6% |
| O | 204645 | 5.3% |
| T | 202523 | 5.3% |
| L | 149821 | 3.9% |
| Other values (84) | 1233261 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3842531 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 402433 | 10.5% | |
| A | 382411 | 10.0% |
| E | 345814 | 9.0% |
| R | 241271 | 6.3% |
| N | 235867 | 6.1% |
| I | 228027 | 5.9% |
| S | 216458 | 5.6% |
| O | 204645 | 5.3% |
| T | 202523 | 5.3% |
| L | 149821 | 3.9% |
| Other values (84) | 1233261 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3842531 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 402433 | 10.5% | |
| A | 382411 | 10.0% |
| E | 345814 | 9.0% |
| R | 241271 | 6.3% |
| N | 235867 | 6.1% |
| I | 228027 | 5.9% |
| S | 216458 | 5.6% |
| O | 204645 | 5.3% |
| T | 202523 | 5.3% |
| L | 149821 | 3.9% |
| Other values (84) | 1233261 |
boro
Categorical
High correlation 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| Manhattan | |
|---|---|
| Brooklyn | |
| Queens | |
| Bronx | |
| Staten Island | 8742 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 7.8095765 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Queens |
|---|---|
| 2nd row | Queens |
| 3rd row | Manhattan |
| 4th row | Brooklyn |
| 5th row | Queens |
Common Values
| Value | Count | Frequency (%) |
| Manhattan | 87929 | |
| Brooklyn | 65128 | |
| Queens | 56338 | |
| Bronx | 21534 | 9.0% |
| Staten Island | 8742 | 3.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| manhattan | 87929 | |
| brooklyn | 65128 | |
| queens | 56338 | |
| bronx | 21534 | 8.7% |
| staten | 8742 | 3.5% |
| island | 8742 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 336342 | |
| a | 281271 | |
| t | 193342 | |
| o | 151790 | |
| e | 121418 | 6.5% |
| M | 87929 | 4.7% |
| h | 87929 | 4.7% |
| B | 86662 | 4.6% |
| r | 86662 | 4.6% |
| l | 73870 | 3.9% |
| Other values (10) | 364514 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1871729 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 336342 | |
| a | 281271 | |
| t | 193342 | |
| o | 151790 | |
| e | 121418 | 6.5% |
| M | 87929 | 4.7% |
| h | 87929 | 4.7% |
| B | 86662 | 4.6% |
| r | 86662 | 4.6% |
| l | 73870 | 3.9% |
| Other values (10) | 364514 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1871729 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 336342 | |
| a | 281271 | |
| t | 193342 | |
| o | 151790 | |
| e | 121418 | 6.5% |
| M | 87929 | 4.7% |
| h | 87929 | 4.7% |
| B | 86662 | 4.6% |
| r | 86662 | 4.6% |
| l | 73870 | 3.9% |
| Other values (10) | 364514 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1871729 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 336342 | |
| a | 281271 | |
| t | 193342 | |
| o | 151790 | |
| e | 121418 | 6.5% |
| M | 87929 | 4.7% |
| h | 87929 | 4.7% |
| B | 86662 | 4.6% |
| r | 86662 | 4.6% |
| l | 73870 | 3.9% |
| Other values (10) | 364514 |
building
Text
| Distinct | 7357 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 246 |
| Missing (%) | 0.1% |
| Memory size | 1.8 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 3.4706735 |
| Min length | 1 |
Unique
| Unique | 87 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 23917 |
|---|---|
| 2nd row | 3708 |
| 3rd row | 12 |
| 4th row | 1377 |
| 5th row | 7242 |
| Value | Count | Frequency (%) |
| 1 | 1097 | 0.5% |
| 200 | 752 | 0.3% |
| 2 | 671 | 0.3% |
| 25 | 645 | 0.3% |
| 55 | 609 | 0.3% |
| 20 | 579 | 0.2% |
| 100 | 578 | 0.2% |
| 11 | 568 | 0.2% |
| 30 | 565 | 0.2% |
| 10 | 535 | 0.2% |
| Other values (7325) | 233100 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 154824 | |
| 2 | 108233 | |
| 0 | 93214 | |
| 3 | 83432 | |
| 5 | 77692 | |
| 4 | 75449 | |
| 6 | 62372 | |
| 7 | 59264 | 7.1% |
| 8 | 55386 | 6.7% |
| 9 | 50433 | 6.1% |
| Other values (27) | 10667 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 830966 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 154824 | |
| 2 | 108233 | |
| 0 | 93214 | |
| 3 | 83432 | |
| 5 | 77692 | |
| 4 | 75449 | |
| 6 | 62372 | |
| 7 | 59264 | 7.1% |
| 8 | 55386 | 6.7% |
| 9 | 50433 | 6.1% |
| Other values (27) | 10667 | 1.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 830966 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 154824 | |
| 2 | 108233 | |
| 0 | 93214 | |
| 3 | 83432 | |
| 5 | 77692 | |
| 4 | 75449 | |
| 6 | 62372 | |
| 7 | 59264 | 7.1% |
| 8 | 55386 | 6.7% |
| 9 | 50433 | 6.1% |
| Other values (27) | 10667 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 830966 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 154824 | |
| 2 | 108233 | |
| 0 | 93214 | |
| 3 | 83432 | |
| 5 | 77692 | |
| 4 | 75449 | |
| 6 | 62372 | |
| 7 | 59264 | 7.1% |
| 8 | 55386 | 6.7% |
| 9 | 50433 | 6.1% |
| Other values (27) | 10667 | 1.3% |
street
Text
| Distinct | 2295 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 37 |
| Mean length | 12.943105 |
| Min length | 5 |
Unique
| Unique | 44 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | BRADDOCK AVE |
|---|---|
| 2nd row | 73RD ST |
| 3rd row | PERRY STREET |
| 4th row | EAST NEW YORK AVENUE |
| 5th row | AUSTIN ST |
| Value | Count | Frequency (%) |
| avenue | 95872 | 18.2% |
| street | 66391 | 12.6% |
| west | 18020 | 3.4% |
| east | 16843 | 3.2% |
| ave | 16725 | 3.2% |
| blvd | 11429 | 2.2% |
| broadway | 10127 | 1.9% |
| boulevard | 9013 | 1.7% |
| st | 8733 | 1.7% |
| road | 6692 | 1.3% |
| Other values (1426) | 265730 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 495538 | |
| 343375 | ||
| T | 268742 | 8.7% |
| A | 267827 | 8.6% |
| N | 197530 | 6.4% |
| R | 194365 | 6.3% |
| S | 181355 | 5.8% |
| V | 144157 | 4.6% |
| U | 138237 | 4.5% |
| O | 112789 | 3.6% |
| Other values (60) | 758172 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3102087 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 495538 | |
| 343375 | ||
| T | 268742 | 8.7% |
| A | 267827 | 8.6% |
| N | 197530 | 6.4% |
| R | 194365 | 6.3% |
| S | 181355 | 5.8% |
| V | 144157 | 4.6% |
| U | 138237 | 4.5% |
| O | 112789 | 3.6% |
| Other values (60) | 758172 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3102087 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 495538 | |
| 343375 | ||
| T | 268742 | 8.7% |
| A | 267827 | 8.6% |
| N | 197530 | 6.4% |
| R | 194365 | 6.3% |
| S | 181355 | 5.8% |
| V | 144157 | 4.6% |
| U | 138237 | 4.5% |
| O | 112789 | 3.6% |
| Other values (60) | 758172 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3102087 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 495538 | |
| 343375 | ||
| T | 268742 | 8.7% |
| A | 267827 | 8.6% |
| N | 197530 | 6.4% |
| R | 194365 | 6.3% |
| S | 181355 | 5.8% |
| V | 144157 | 4.6% |
| U | 138237 | 4.5% |
| O | 112789 | 3.6% |
| Other values (60) | 758172 |
zipcode
Real number (ℝ)
High correlation  Missing 
| Distinct | 219 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2416 |
| Missing (%) | 1.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10709.42 |
| Minimum | 10000 |
|---|---|
| Maximum | 12345 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 10000 |
|---|---|
| 5-th percentile | 10003 |
| Q1 | 10023 |
| median | 11101 |
| Q3 | 11231 |
| 95-th percentile | 11416 |
| Maximum | 12345 |
| Range | 2345 |
| Interquartile range (IQR) | 1208 |
Descriptive statistics
| Standard deviation | 593.05841 |
|---|---|
| Coefficient of variation (CV) | 0.055377268 |
| Kurtosis | -1.815696 |
| Mean | 10709.42 |
| Median Absolute Deviation (MAD) | 334 |
| Skewness | -0.10268631 |
| Sum | 2.5408634 × 109 |
| Variance | 351718.28 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10003 | 5484 | 2.3% |
| 10013 | 5434 | 2.3% |
| 10019 | 4817 | 2.0% |
| 10001 | 4762 | 2.0% |
| 10036 | 4528 | 1.9% |
| 11354 | 4513 | 1.9% |
| 10002 | 4400 | 1.8% |
| 11201 | 4277 | 1.8% |
| 11220 | 3938 | 1.6% |
| 11372 | 3854 | 1.6% |
| Other values (209) | 191248 |
| Value | Count | Frequency (%) |
| 10000 | 18 | < 0.1% |
| 10001 | 4762 | |
| 10002 | 4400 | |
| 10003 | 5484 | |
| 10004 | 1163 | 0.5% |
| 10005 | 659 | 0.3% |
| 10006 | 392 | 0.2% |
| 10007 | 1395 | 0.6% |
| 10009 | 2552 | |
| 10010 | 2051 | 0.9% |
| Value | Count | Frequency (%) |
| 12345 | 7 | < 0.1% |
| 11697 | 16 | < 0.1% |
| 11694 | 246 | 0.1% |
| 11693 | 223 | 0.1% |
| 11692 | 87 | < 0.1% |
| 11691 | 420 | 0.2% |
| 11436 | 223 | 0.1% |
| 11435 | 1172 | |
| 11434 | 869 | |
| 11433 | 241 | 0.1% |
phone
Text
| Distinct | 24101 |
|---|---|
| Distinct (%) | 10.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 636 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 7183474271 |
|---|---|
| 2nd row | 3478917275 |
| 3rd row | 9172922325 |
| 4th row | 7189751121 |
| 5th row | 3479560001 |
| Value | Count | Frequency (%) |
| 2126159700 | 144 | 0.1% |
| 7182246030 | 131 | 0.1% |
| 2122441111 | 126 | 0.1% |
| 9082308846 | 115 | < 0.1% |
| 3477017760 | 105 | < 0.1% |
| 9178863304 | 103 | < 0.1% |
| 9175103862 | 103 | < 0.1% |
| 9177709055 | 92 | < 0.1% |
| 7043285184 | 91 | < 0.1% |
| 9175878888 | 90 | < 0.1% |
| Other values (24091) | 238571 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 310220 | |
| 1 | 306546 | |
| 2 | 294454 | |
| 8 | 284208 | |
| 6 | 224774 | |
| 4 | 213958 | |
| 3 | 202962 | |
| 9 | 201212 | |
| 0 | 188824 | |
| 5 | 168398 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2396710 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 7 | 310220 | |
| 1 | 306546 | |
| 2 | 294454 | |
| 8 | 284208 | |
| 6 | 224774 | |
| 4 | 213958 | |
| 3 | 202962 | |
| 9 | 201212 | |
| 0 | 188824 | |
| 5 | 168398 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2396710 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 7 | 310220 | |
| 1 | 306546 | |
| 2 | 294454 | |
| 8 | 284208 | |
| 6 | 224774 | |
| 4 | 213958 | |
| 3 | 202962 | |
| 9 | 201212 | |
| 0 | 188824 | |
| 5 | 168398 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2396710 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 7 | 310220 | |
| 1 | 306546 | |
| 2 | 294454 | |
| 8 | 284208 | |
| 6 | 224774 | |
| 4 | 213958 | |
| 3 | 202962 | |
| 9 | 201212 | |
| 0 | 188824 | |
| 5 | 168398 |
| Distinct | 89 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
Length
| Max length | 30 |
|---|---|
| Median length | 24 |
| Mean length | 9.6617738 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Spanish |
|---|---|
| 2nd row | Bangladeshi |
| 3rd row | Pizza |
| 4th row | Caribbean |
| 5th row | Japanese |
| Value | Count | Frequency (%) |
| american | 48343 | 16.9% |
| chinese | 23443 | 8.2% |
| coffee/tea | 16865 | 5.9% |
| pizza | 14865 | 5.2% |
| latin | 10128 | 3.5% |
| mexican | 9721 | 3.4% |
| bakery | 9712 | 3.4% |
| products/desserts | 9712 | 3.4% |
| caribbean | 8897 | 3.1% |
| japanese | 8099 | 2.8% |
| Other values (90) | 127018 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 310597 | 13.4% |
| a | 241972 | 10.4% |
| i | 200604 | 8.7% |
| n | 192637 | 8.3% |
| s | 151929 | 6.6% |
| r | 137964 | 6.0% |
| c | 89710 | 3.9% |
| t | 73919 | 3.2% |
| h | 69611 | 3.0% |
| o | 66994 | 2.9% |
| Other values (41) | 779710 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2315647 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 310597 | 13.4% |
| a | 241972 | 10.4% |
| i | 200604 | 8.7% |
| n | 192637 | 8.3% |
| s | 151929 | 6.6% |
| r | 137964 | 6.0% |
| c | 89710 | 3.9% |
| t | 73919 | 3.2% |
| h | 69611 | 3.0% |
| o | 66994 | 2.9% |
| Other values (41) | 779710 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2315647 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 310597 | 13.4% |
| a | 241972 | 10.4% |
| i | 200604 | 8.7% |
| n | 192637 | 8.3% |
| s | 151929 | 6.6% |
| r | 137964 | 6.0% |
| c | 89710 | 3.9% |
| t | 73919 | 3.2% |
| h | 69611 | 3.0% |
| o | 66994 | 2.9% |
| Other values (41) | 779710 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2315647 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 310597 | 13.4% |
| a | 241972 | 10.4% |
| i | 200604 | 8.7% |
| n | 192637 | 8.3% |
| s | 151929 | 6.6% |
| r | 137964 | 6.0% |
| c | 89710 | 3.9% |
| t | 73919 | 3.2% |
| h | 69611 | 3.0% |
| o | 66994 | 2.9% |
| Other values (41) | 779710 |
inspection_date
Date
| Distinct | 1271 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| Minimum | 2019-01-03 00:00:00 |
|---|---|
| Maximum | 2024-09-30 00:00:00 |
action
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| Violations were cited in the following area(s). | |
|---|---|
| Establishment Closed by DOHMH. Violations were cited in the following area(s) and those requiring immediate action were addressed. | 8869 |
| Establishment re-opened by DOHMH. | 1886 |
| No violations were recorded at the time of this inspection. | 1383 |
| Establishment re-closed by DOHMH. | 5 |
Length
| Max length | 130 |
|---|---|
| Median length | 47 |
| Mean length | 50.030191 |
| Min length | 33 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Violations were cited in the following area(s). |
|---|---|
| 2nd row | Violations were cited in the following area(s). |
| 3rd row | Violations were cited in the following area(s). |
| 4th row | Violations were cited in the following area(s). |
| 5th row | Violations were cited in the following area(s). |
Common Values
| Value | Count | Frequency (%) |
| Violations were cited in the following area(s). | 227528 | |
| Establishment Closed by DOHMH. Violations were cited in the following area(s) and those requiring immediate action were addressed. | 8869 | 3.7% |
| Establishment re-opened by DOHMH. | 1886 | 0.8% |
| No violations were recorded at the time of this inspection. | 1383 | 0.6% |
| Establishment re-closed by DOHMH. | 5 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| were | 246649 | |
| violations | 237780 | |
| the | 237780 | |
| cited | 236397 | |
| in | 236397 | |
| following | 236397 | |
| area(s | 236397 | |
| establishment | 10760 | 0.6% |
| by | 10760 | 0.6% |
| dohmh | 10760 | 0.6% |
| Other values (16) | 73655 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1534061 | ||
| e | 1287915 | |
| i | 1245388 | |
| o | 982384 | 8.2% |
| t | 765616 | 6.4% |
| a | 758193 | 6.3% |
| n | 752593 | 6.3% |
| l | 730208 | 6.1% |
| s | 533944 | 4.5% |
| r | 514310 | 4.3% |
| Other values (25) | 2886174 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 11990786 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1534061 | ||
| e | 1287915 | |
| i | 1245388 | |
| o | 982384 | 8.2% |
| t | 765616 | 6.4% |
| a | 758193 | 6.3% |
| n | 752593 | 6.3% |
| l | 730208 | 6.1% |
| s | 533944 | 4.5% |
| r | 514310 | 4.3% |
| Other values (25) | 2886174 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 11990786 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1534061 | ||
| e | 1287915 | |
| i | 1245388 | |
| o | 982384 | 8.2% |
| t | 765616 | 6.4% |
| a | 758193 | 6.3% |
| n | 752593 | 6.3% |
| l | 730208 | 6.1% |
| s | 533944 | 4.5% |
| r | 514310 | 4.3% |
| Other values (25) | 2886174 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 11990786 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1534061 | ||
| e | 1287915 | |
| i | 1245388 | |
| o | 982384 | 8.2% |
| t | 765616 | 6.4% |
| a | 758193 | 6.3% |
| n | 752593 | 6.3% |
| l | 730208 | 6.1% |
| s | 533944 | 4.5% |
| r | 514310 | 4.3% |
| Other values (25) | 2886174 |
violation_code
Text
| Distinct | 138 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1373 |
| Missing (%) | 0.6% |
| Memory size | 1.8 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.0718974 |
| Min length | 3 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 04L |
|---|---|
| 2nd row | 04A |
| 3rd row | 10G |
| 4th row | 08A |
| 5th row | 04L |
| Value | Count | Frequency (%) |
| 10f | 33519 | |
| 08a | 24495 | 10.3% |
| 06d | 15965 | 6.7% |
| 02g | 14958 | 6.3% |
| 04l | 14425 | 6.1% |
| 10b | 14264 | 6.0% |
| 06c | 13828 | 5.8% |
| 02b | 12623 | 5.3% |
| 04n | 10862 | 4.6% |
| 04a | 7063 | 3.0% |
| Other values (128) | 76296 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 241140 | |
| 1 | 65307 | 8.9% |
| 4 | 50790 | 6.9% |
| 6 | 46929 | 6.4% |
| 2 | 39086 | 5.3% |
| A | 38989 | 5.3% |
| F | 38700 | 5.3% |
| 8 | 32984 | 4.5% |
| B | 31256 | 4.3% |
| C | 23739 | 3.2% |
| Other values (17) | 123107 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 732027 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 241140 | |
| 1 | 65307 | 8.9% |
| 4 | 50790 | 6.9% |
| 6 | 46929 | 6.4% |
| 2 | 39086 | 5.3% |
| A | 38989 | 5.3% |
| F | 38700 | 5.3% |
| 8 | 32984 | 4.5% |
| B | 31256 | 4.3% |
| C | 23739 | 3.2% |
| Other values (17) | 123107 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 732027 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 241140 | |
| 1 | 65307 | 8.9% |
| 4 | 50790 | 6.9% |
| 6 | 46929 | 6.4% |
| 2 | 39086 | 5.3% |
| A | 38989 | 5.3% |
| F | 38700 | 5.3% |
| 8 | 32984 | 4.5% |
| B | 31256 | 4.3% |
| C | 23739 | 3.2% |
| Other values (17) | 123107 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 732027 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 241140 | |
| 1 | 65307 | 8.9% |
| 4 | 50790 | 6.9% |
| 6 | 46929 | 6.4% |
| 2 | 39086 | 5.3% |
| A | 38989 | 5.3% |
| F | 38700 | 5.3% |
| 8 | 32984 | 4.5% |
| B | 31256 | 4.3% |
| C | 23739 | 3.2% |
| Other values (17) | 123107 |
| Distinct | 216 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 1373 |
| Missing (%) | 0.6% |
| Memory size | 1.8 MiB |
Length
| Max length | 952 |
|---|---|
| Median length | 305 |
| Mean length | 157.88182 |
| Min length | 19 |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Evidence of mice or live mice in establishment's food or non-food areas. |
|---|---|
| 2nd row | Food Protection Certificate (FPC) not held by manager or supervisor of food operations. |
| 3rd row | Dishwashing and ware washing: Cleaning and sanitizing of tableware, including dishes, utensils, and equipment deficient. |
| 4th row | Establishment is not free of harborage or conditions conducive to rodents, insects or other pests. |
| 5th row | Evidence of mice or live mice in establishment's food or non-food areas. |
| Value | Count | Frequency (%) |
| or | 420324 | 7.6% |
| not | 258078 | 4.7% |
| food | 142548 | 2.6% |
| and | 131760 | 2.4% |
| flies | 113384 | 2.0% |
| of | 110335 | 2.0% |
| above | 101696 | 1.8% |
| to | 92850 | 1.7% |
| properly | 88322 | 1.6% |
| in | 82550 | 1.5% |
| Other values (844) | 4005115 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5333481 | ||
| e | 3589796 | 9.5% |
| o | 3108525 | 8.3% |
| n | 2388393 | 6.3% |
| a | 2342883 | 6.2% |
| i | 2298441 | 6.1% |
| r | 2244760 | 6.0% |
| t | 2211686 | 5.9% |
| s | 1836395 | 4.9% |
| d | 1664974 | 4.4% |
| Other values (66) | 10603589 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 37622923 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5333481 | ||
| e | 3589796 | 9.5% |
| o | 3108525 | 8.3% |
| n | 2388393 | 6.3% |
| a | 2342883 | 6.2% |
| i | 2298441 | 6.1% |
| r | 2244760 | 6.0% |
| t | 2211686 | 5.9% |
| s | 1836395 | 4.9% |
| d | 1664974 | 4.4% |
| Other values (66) | 10603589 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 37622923 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5333481 | ||
| e | 3589796 | 9.5% |
| o | 3108525 | 8.3% |
| n | 2388393 | 6.3% |
| a | 2342883 | 6.2% |
| i | 2298441 | 6.1% |
| r | 2244760 | 6.0% |
| t | 2211686 | 5.9% |
| s | 1836395 | 4.9% |
| d | 1664974 | 4.4% |
| Other values (66) | 10603589 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 37622923 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5333481 | ||
| e | 3589796 | 9.5% |
| o | 3108525 | 8.3% |
| n | 2388393 | 6.3% |
| a | 2342883 | 6.2% |
| i | 2298441 | 6.1% |
| r | 2244760 | 6.0% |
| t | 2211686 | 5.9% |
| s | 1836395 | 4.9% |
| d | 1664974 | 4.4% |
| Other values (66) | 10603589 |
critical_flag
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| Critical | |
|---|---|
| Not Critical | |
| Not Applicable | 2739 |
Length
| Max length | 14 |
|---|---|
| Median length | 8 |
| Mean length | 9.8442198 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Critical |
|---|---|
| 2nd row | Critical |
| 3rd row | Not Critical |
| 4th row | Not Critical |
| 5th row | Critical |
Common Values
| Value | Count | Frequency (%) |
| Critical | 130539 | |
| Not Critical | 106393 | |
| Not Applicable | 2739 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| critical | 236932 | |
| not | 109132 | |
| applicable | 2739 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 476603 | |
| t | 346064 | |
| l | 242410 | |
| c | 239671 | |
| a | 239671 | |
| r | 236932 | |
| C | 236932 | |
| N | 109132 | 4.6% |
| o | 109132 | 4.6% |
| 109132 | 4.6% | |
| Other values (4) | 13695 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2359374 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 476603 | |
| t | 346064 | |
| l | 242410 | |
| c | 239671 | |
| a | 239671 | |
| r | 236932 | |
| C | 236932 | |
| N | 109132 | 4.6% |
| o | 109132 | 4.6% |
| 109132 | 4.6% | |
| Other values (4) | 13695 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2359374 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 476603 | |
| t | 346064 | |
| l | 242410 | |
| c | 239671 | |
| a | 239671 | |
| r | 236932 | |
| C | 236932 | |
| N | 109132 | 4.6% |
| o | 109132 | 4.6% |
| 109132 | 4.6% | |
| Other values (4) | 13695 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2359374 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 476603 | |
| t | 346064 | |
| l | 242410 | |
| c | 239671 | |
| a | 239671 | |
| r | 236932 | |
| C | 236932 | |
| N | 109132 | 4.6% |
| o | 109132 | 4.6% |
| 109132 | 4.6% | |
| Other values (4) | 13695 | 0.6% |
score
Real number (ℝ)
Missing  Zeros 
| Distinct | 136 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 9228 |
| Missing (%) | 3.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.951398 |
| Minimum | 0 |
|---|---|
| Maximum | 168 |
| Zeros | 8788 |
| Zeros (%) | 3.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 12 |
| median | 20 |
| Q3 | 32 |
| 95-th percentile | 60 |
| Maximum | 168 |
| Range | 168 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 18.17703 |
|---|---|
| Coefficient of variation (CV) | 0.7589131 |
| Kurtosis | 4.0173809 |
| Mean | 23.951398 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 1.6168023 |
| Sum | 5519432 |
| Variance | 330.4044 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 19985 | 8.3% |
| 13 | 18657 | 7.8% |
| 10 | 9599 | 4.0% |
| 11 | 8909 | 3.7% |
| 0 | 8788 | 3.7% |
| 9 | 8295 | 3.5% |
| 7 | 6248 | 2.6% |
| 27 | 6207 | 2.6% |
| 22 | 5619 | 2.3% |
| 26 | 5594 | 2.3% |
| Other values (126) | 132542 | |
| (Missing) | 9228 | 3.9% |
| Value | Count | Frequency (%) |
| 0 | 8788 | |
| 2 | 2736 | 1.1% |
| 3 | 1035 | 0.4% |
| 4 | 2180 | 0.9% |
| 5 | 3121 | 1.3% |
| 6 | 1277 | 0.5% |
| 7 | 6248 | |
| 8 | 3252 | 1.4% |
| 9 | 8295 | |
| 10 | 9599 |
| Value | Count | Frequency (%) |
| 168 | 9 | < 0.1% |
| 154 | 12 | < 0.1% |
| 153 | 20 | |
| 143 | 14 | < 0.1% |
| 142 | 38 | |
| 141 | 9 | < 0.1% |
| 138 | 27 | |
| 136 | 24 | |
| 134 | 9 | < 0.1% |
| 132 | 15 | < 0.1% |
record_date
Date
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| Minimum | 2024-11-18 06:00:10 |
|---|---|
| Maximum | 2024-11-18 06:00:12 |
inspection_type
Categorical
High correlation  Imbalance 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.8 MiB |
| Cycle Inspection / Initial Inspection | |
|---|---|
| Cycle Inspection / Re-inspection | |
| Pre-permit (Operational) / Initial Inspection | |
| Pre-permit (Operational) / Re-inspection | 10203 |
| Administrative Miscellaneous / Initial Inspection | 6330 |
| Other values (26) | 11838 |
Length
| Max length | 59 |
|---|---|
| Median length | 37 |
| Mean length | 38.089018 |
| Min length | 25 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Cycle Inspection / Initial Inspection |
|---|---|
| 2nd row | Pre-permit (Operational) / Re-inspection |
| 3rd row | Cycle Inspection / Initial Inspection |
| 4th row | Cycle Inspection / Initial Inspection |
| 5th row | Cycle Inspection / Initial Inspection |
Common Values
| Value | Count | Frequency (%) |
| Cycle Inspection / Initial Inspection | 130860 | |
| Cycle Inspection / Re-inspection | 44258 | 18.5% |
| Pre-permit (Operational) / Initial Inspection | 36182 | 15.1% |
| Pre-permit (Operational) / Re-inspection | 10203 | 4.3% |
| Administrative Miscellaneous / Initial Inspection | 6330 | 2.6% |
| Pre-permit (Non-operational) / Initial Inspection | 3106 | 1.3% |
| Pre-permit (Operational) / Compliance Inspection | 1718 | 0.7% |
| Cycle Inspection / Reopening Inspection | 1444 | 0.6% |
| Administrative Miscellaneous / Re-inspection | 1213 | 0.5% |
| Cycle Inspection / Compliance Inspection | 991 | 0.4% |
| Other values (21) | 3366 | 1.4% |
Length
| Value | Count | Frequency (%) |
| inspection | 361176 | |
| 239671 | ||
| initial | 178203 | |
| cycle | 177611 | |
| re-inspection | 56106 | 4.9% |
| pre-permit | 52416 | 4.6% |
| operational | 49007 | 4.3% |
| administrative | 7731 | 0.7% |
| miscellaneous | 7731 | 0.7% |
| non-operational | 3409 | 0.3% |
| Other values (14) | 10430 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 1093646 | |
| i | 972343 | |
| 903820 | ||
| e | 842649 | |
| t | 717695 | 7.9% |
| c | 607431 | 6.7% |
| I | 539795 | 5.9% |
| p | 527473 | 5.8% |
| o | 491710 | 5.4% |
| s | 441822 | 4.8% |
| Other values (25) | 1990449 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 9128833 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 1093646 | |
| i | 972343 | |
| 903820 | ||
| e | 842649 | |
| t | 717695 | 7.9% |
| c | 607431 | 6.7% |
| I | 539795 | 5.9% |
| p | 527473 | 5.8% |
| o | 491710 | 5.4% |
| s | 441822 | 4.8% |
| Other values (25) | 1990449 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 9128833 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 1093646 | |
| i | 972343 | |
| 903820 | ||
| e | 842649 | |
| t | 717695 | 7.9% |
| c | 607431 | 6.7% |
| I | 539795 | 5.9% |
| p | 527473 | 5.8% |
| o | 491710 | 5.4% |
| s | 441822 | 4.8% |
| Other values (25) | 1990449 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 9128833 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 1093646 | |
| i | 972343 | |
| 903820 | ||
| e | 842649 | |
| t | 717695 | 7.9% |
| c | 607431 | 6.7% |
| I | 539795 | 5.9% |
| p | 527473 | 5.8% |
| o | 491710 | 5.4% |
| s | 441822 | 4.8% |
| Other values (25) | 1990449 |
latitude
Real number (ℝ)
High correlation  Zeros 
| Distinct | 22108 |
|---|---|
| Distinct (%) | 9.2% |
| Missing | 253 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.315141 |
| Minimum | 0 |
|---|---|
| Maximum | 40.912822 |
| Zeros | 2416 |
| Zeros (%) | 1.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 40.60014 |
| Q1 | 40.682931 |
| median | 40.729807 |
| Q3 | 40.760697 |
| 95-th percentile | 40.850955 |
| Maximum | 40.912822 |
| Range | 40.912822 |
| Interquartile range (IQR) | 0.077766154 |
Descriptive statistics
| Standard deviation | 4.0710132 |
|---|---|
| Coefficient of variation (CV) | 0.10097976 |
| Kurtosis | 94.054545 |
| Mean | 40.315141 |
| Median Absolute Deviation (MAD) | 0.035899198 |
| Skewness | -9.7992694 |
| Sum | 9652170.3 |
| Variance | 16.573148 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2416 | 1.0% |
| 40.64831283 | 348 | 0.1% |
| 40.58229742 | 332 | 0.1% |
| 40.75977791 | 327 | 0.1% |
| 40.73384018 | 297 | 0.1% |
| 40.69082623 | 244 | 0.1% |
| 40.758502 | 224 | 0.1% |
| 40.74186904 | 224 | 0.1% |
| 40.86590541 | 178 | 0.1% |
| 40.60992885 | 175 | 0.1% |
| Other values (22098) | 234653 | |
| (Missing) | 253 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2416 | |
| 40.49956271 | 5 | < 0.1% |
| 40.50806852 | 4 | < 0.1% |
| 40.50911465 | 11 | < 0.1% |
| 40.50917535 | 8 | < 0.1% |
| 40.5099219 | 5 | < 0.1% |
| 40.50993835 | 8 | < 0.1% |
| 40.50999021 | 1 | < 0.1% |
| 40.51062322 | 8 | < 0.1% |
| 40.51075743 | 17 | < 0.1% |
| Value | Count | Frequency (%) |
| 40.91282233 | 14 | |
| 40.91047336 | 2 | < 0.1% |
| 40.91001188 | 14 | |
| 40.90982301 | 15 | |
| 40.90732865 | 9 | |
| 40.90722436 | 7 | |
| 40.90688982 | 7 | |
| 40.90671944 | 5 | < 0.1% |
| 40.90664757 | 10 | |
| 40.9065722 | 13 |
longitude
Real number (ℝ)
High correlation  Zeros 
| Distinct | 22108 |
|---|---|
| Distinct (%) | 9.2% |
| Missing | 253 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.195859 |
| Minimum | -74.248708 |
|---|---|
| Maximum | 0 |
| Zeros | 2416 |
| Zeros (%) | 1.0% |
| Negative | 237002 |
| Negative (%) | 98.9% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | -74.248708 |
|---|---|
| 5-th percentile | -74.016176 |
| Q1 | -73.989255 |
| median | -73.957175 |
| Q3 | -73.897027 |
| 95-th percentile | -73.79157 |
| Maximum | 0 |
| Range | 74.248708 |
| Interquartile range (IQR) | 0.092228584 |
Descriptive statistics
| Standard deviation | 7.3906524 |
|---|---|
| Coefficient of variation (CV) | -0.10097091 |
| Kurtosis | 94.088567 |
| Mean | -73.195859 |
| Median Absolute Deviation (MAD) | 0.038398436 |
| Skewness | 9.8018994 |
| Sum | -17524406 |
| Variance | 54.621743 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2416 | 1.0% |
| -73.7882815 | 348 | 0.1% |
| -74.16905259 | 332 | 0.1% |
| -73.82923543 | 327 | 0.1% |
| -73.87157703 | 297 | 0.1% |
| -73.98345225 | 244 | 0.1% |
| -73.83324181 | 224 | 0.1% |
| -74.00471301 | 224 | 0.1% |
| -73.83042975 | 178 | 0.1% |
| -73.92228162 | 175 | 0.1% |
| Other values (22098) | 234653 | |
| (Missing) | 253 | 0.1% |
| Value | Count | Frequency (%) |
| -74.24870792 | 6 | < 0.1% |
| -74.24850215 | 17 | |
| -74.24843447 | 2 | < 0.1% |
| -74.24837218 | 8 | |
| -74.24801199 | 10 | |
| -74.24661164 | 11 | |
| -74.24646442 | 8 | |
| -74.24392834 | 8 | |
| -74.24392109 | 5 | < 0.1% |
| -74.24266267 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2416 | |
| -73.70092806 | 11 | < 0.1% |
| -73.70171187 | 9 | < 0.1% |
| -73.70268132 | 13 | < 0.1% |
| -73.70269217 | 1 | < 0.1% |
| -73.70272112 | 13 | < 0.1% |
| -73.70272827 | 7 | < 0.1% |
| -73.70274635 | 11 | < 0.1% |
| -73.70276092 | 16 | < 0.1% |
| -73.70278986 | 21 | < 0.1% |
community_board
Real number (ℝ)
High correlation  Missing 
| Distinct | 69 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2875 |
| Missing (%) | 1.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 255.23934 |
| Minimum | 101 |
|---|---|
| Maximum | 595 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 101 |
|---|---|
| 5-th percentile | 102 |
| Q1 | 106 |
| median | 302 |
| Q3 | 401 |
| 95-th percentile | 413 |
| Maximum | 595 |
| Range | 494 |
| Interquartile range (IQR) | 295 |
Descriptive statistics
| Standard deviation | 129.79592 |
|---|---|
| Coefficient of variation (CV) | 0.50852632 |
| Kurtosis | -1.4134476 |
| Mean | 255.23934 |
| Median Absolute Deviation (MAD) | 105 |
| Skewness | 0.076842019 |
| Sum | 60439654 |
| Variance | 16846.981 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 105 | 17739 | 7.4% |
| 103 | 11799 | 4.9% |
| 102 | 10846 | 4.5% |
| 104 | 9273 | 3.9% |
| 407 | 8976 | 3.7% |
| 301 | 8066 | 3.4% |
| 101 | 7136 | 3.0% |
| 106 | 7108 | 3.0% |
| 401 | 6800 | 2.8% |
| 108 | 6640 | 2.8% |
| Other values (59) | 142413 |
| Value | Count | Frequency (%) |
| 101 | 7136 | |
| 102 | 10846 | |
| 103 | 11799 | |
| 104 | 9273 | |
| 105 | 17739 | |
| 106 | 7108 | |
| 107 | 5273 | 2.2% |
| 108 | 6640 | 2.8% |
| 109 | 2094 | 0.9% |
| 110 | 1718 | 0.7% |
| Value | Count | Frequency (%) |
| 595 | 5 | < 0.1% |
| 503 | 2079 | |
| 502 | 2954 | |
| 501 | 3624 | |
| 483 | 256 | 0.1% |
| 482 | 4 | < 0.1% |
| 481 | 47 | < 0.1% |
| 480 | 51 | < 0.1% |
| 414 | 992 | 0.4% |
| 413 | 2865 |
council_district
Real number (ℝ)
High correlation  Missing 
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2865 |
| Missing (%) | 1.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.987893 |
| Minimum | 1 |
|---|---|
| Maximum | 51 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 20 |
| Q3 | 35 |
| 95-th percentile | 48 |
| Maximum | 51 |
| Range | 50 |
| Interquartile range (IQR) | 31 |
Descriptive statistics
| Standard deviation | 15.877508 |
|---|---|
| Coefficient of variation (CV) | 0.75650795 |
| Kurtosis | -1.3187543 |
| Mean | 20.987893 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | 0.22742685 |
| Sum | 4970059 |
| Variance | 252.09526 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 19147 | 8.0% |
| 1 | 17632 | 7.4% |
| 4 | 16096 | 6.7% |
| 2 | 12410 | 5.2% |
| 33 | 7795 | 3.3% |
| 20 | 7206 | 3.0% |
| 34 | 6985 | 2.9% |
| 26 | 6708 | 2.8% |
| 39 | 6385 | 2.7% |
| 43 | 5579 | 2.3% |
| Other values (41) | 130863 |
| Value | Count | Frequency (%) |
| 1 | 17632 | |
| 2 | 12410 | |
| 3 | 19147 | |
| 4 | 16096 | |
| 5 | 4596 | 1.9% |
| 6 | 4968 | 2.1% |
| 7 | 3538 | 1.5% |
| 8 | 3560 | 1.5% |
| 9 | 2245 | 0.9% |
| 10 | 3647 | 1.5% |
| Value | Count | Frequency (%) |
| 51 | 2585 | |
| 50 | 3129 | |
| 49 | 3079 | |
| 48 | 3176 | |
| 47 | 3319 | |
| 46 | 2656 | |
| 45 | 2654 | |
| 44 | 2010 | 0.8% |
| 43 | 5579 | |
| 42 | 1576 | 0.7% |
census_tract
Real number (ℝ)
High correlation  Missing 
| Distinct | 1174 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 2865 |
| Missing (%) | 1.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29849.808 |
| Minimum | 100 |
|---|---|
| Maximum | 162100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 1800 |
| Q1 | 8000 |
| median | 17300 |
| Q3 | 42300 |
| 95-th percentile | 94600 |
| Maximum | 162100 |
| Range | 162000 |
| Interquartile range (IQR) | 34300 |
Descriptive statistics
| Standard deviation | 31273.797 |
|---|---|
| Coefficient of variation (CV) | 1.0477051 |
| Kurtosis | 2.6022362 |
| Mean | 29849.808 |
| Median Absolute Deviation (MAD) | 11600 |
| Skewness | 1.6826656 |
| Sum | 7.0686137 × 109 |
| Variance | 9.7805035 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 87100 | 2185 | 0.9% |
| 6500 | 1952 | 0.8% |
| 2100 | 1826 | 0.8% |
| 2900 | 1810 | 0.8% |
| 4100 | 1768 | 0.7% |
| 700 | 1687 | 0.7% |
| 3800 | 1644 | 0.7% |
| 3300 | 1530 | 0.6% |
| 7600 | 1487 | 0.6% |
| 900 | 1472 | 0.6% |
| Other values (1164) | 219445 | |
| (Missing) | 2865 | 1.2% |
| Value | Count | Frequency (%) |
| 100 | 435 | |
| 200 | 265 | |
| 201 | 34 | < 0.1% |
| 202 | 67 | < 0.1% |
| 300 | 376 | |
| 301 | 38 | < 0.1% |
| 400 | 89 | < 0.1% |
| 500 | 3 | < 0.1% |
| 501 | 107 | < 0.1% |
| 502 | 160 | 0.1% |
| Value | Count | Frequency (%) |
| 162100 | 43 | < 0.1% |
| 161700 | 197 | |
| 157903 | 22 | < 0.1% |
| 157902 | 134 | |
| 157901 | 93 | |
| 157101 | 100 | |
| 155102 | 180 | |
| 155101 | 34 | < 0.1% |
| 152902 | 49 | < 0.1% |
| 152901 | 48 | < 0.1% |
bin
Real number (ℝ)
High correlation  Missing 
| Distinct | 19235 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 4099 |
| Missing (%) | 1.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2584827.6 |
| Minimum | 1000000 |
|---|---|
| Maximum | 5799501 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 1000000 |
|---|---|
| 5-th percentile | 1005764 |
| Q1 | 1051842 |
| median | 3022822 |
| Q3 | 4007881 |
| 95-th percentile | 4532160 |
| Maximum | 5799501 |
| Range | 4799501 |
| Interquartile range (IQR) | 2956039 |
Descriptive statistics
| Standard deviation | 1345464.7 |
|---|---|
| Coefficient of variation (CV) | 0.52052397 |
| Kurtosis | -1.4336572 |
| Mean | 2584827.6 |
| Median Absolute Deviation (MAD) | 1130022 |
| Skewness | 0.087172864 |
| Sum | 6.08913 × 1011 |
| Variance | 1.8102752 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4000000 | 899 | 0.4% |
| 1000000 | 427 | 0.2% |
| 3000000 | 359 | 0.1% |
| 5039658 | 332 | 0.1% |
| 4113546 | 327 | 0.1% |
| 4045999 | 313 | 0.1% |
| 1012541 | 247 | 0.1% |
| 3397861 | 244 | 0.1% |
| 4112276 | 236 | 0.1% |
| 5000000 | 215 | 0.1% |
| Other values (19225) | 231973 | |
| (Missing) | 4099 | 1.7% |
| Value | Count | Frequency (%) |
| 1000000 | 427 | |
| 1000003 | 16 | < 0.1% |
| 1000005 | 62 | < 0.1% |
| 1000006 | 14 | < 0.1% |
| 1000008 | 14 | < 0.1% |
| 1000009 | 10 | < 0.1% |
| 1000012 | 13 | < 0.1% |
| 1000014 | 27 | < 0.1% |
| 1000018 | 3 | < 0.1% |
| 1000021 | 13 | < 0.1% |
| Value | Count | Frequency (%) |
| 5799501 | 16 | < 0.1% |
| 5174856 | 4 | < 0.1% |
| 5174558 | 2 | < 0.1% |
| 5171931 | 21 | < 0.1% |
| 5171653 | 58 | |
| 5170926 | 3 | < 0.1% |
| 5170602 | 6 | < 0.1% |
| 5170408 | 2 | < 0.1% |
| 5170220 | 5 | < 0.1% |
| 5170018 | 2 | < 0.1% |
bbl
Real number (ℝ)
High correlation 
| Distinct | 18950 |
|---|---|
| Distinct (%) | 7.9% |
| Missing | 459 |
| Missing (%) | 0.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.478926 × 109 |
| Minimum | 1 |
|---|---|
| Maximum | 5.2700005 × 109 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1.00201 × 109 |
| Q1 | 1.0112501 × 109 |
| median | 3.00825 × 109 |
| Q3 | 4.00515 × 109 |
| 95-th percentile | 4.1229201 × 109 |
| Maximum | 5.2700005 × 109 |
| Range | 5.2700005 × 109 |
| Interquartile range (IQR) | 2.9939 × 109 |
Descriptive statistics
| Standard deviation | 1.332146 × 109 |
|---|---|
| Coefficient of variation (CV) | 0.53738838 |
| Kurtosis | -1.344881 |
| Mean | 2.478926 × 109 |
| Median Absolute Deviation (MAD) | 1.04236 × 109 |
| Skewness | 0.067925953 |
| Sum | 5.9298884 × 1014 |
| Variance | 1.7746131 × 1018 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1772 | 0.7% |
| 4 | 855 | 0.4% |
| 3 | 513 | 0.2% |
| 2 | 372 | 0.2% |
| 5024000180 | 332 | 0.1% |
| 4050190005 | 327 | 0.1% |
| 4018600100 | 313 | 0.1% |
| 4142600001 | 256 | 0.1% |
| 1007130001 | 247 | 0.1% |
| 3001497501 | 244 | 0.1% |
| Other values (18940) | 233981 | |
| (Missing) | 459 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 1772 | |
| 2 | 372 | 0.2% |
| 3 | 513 | 0.2% |
| 4 | 855 | |
| 5 | 128 | 0.1% |
| 1000000000 | 2 | < 0.1% |
| 1000010010 | 3 | < 0.1% |
| 1000020001 | 86 | < 0.1% |
| 1000020002 | 16 | < 0.1% |
| 1000030001 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 5270000501 | 16 | |
| 5080470016 | 8 | |
| 5080460001 | 8 | |
| 5080430015 | 1 | < 0.1% |
| 5080340020 | 5 | < 0.1% |
| 5080260008 | 6 | < 0.1% |
| 5080260005 | 17 | |
| 5080260003 | 8 | |
| 5080200116 | 13 | |
| 5080200059 | 3 | < 0.1% |
nta
Text
Missing 
| Distinct | 193 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2875 |
| Missing (%) | 1.2% |
| Memory size | 1.8 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | QN43 |
|---|---|
| 2nd row | QN28 |
| 3rd row | MN23 |
| 4th row | BK79 |
| 5th row | QN17 |
| Value | Count | Frequency (%) |
| mn17 | 13285 | 5.6% |
| mn13 | 6934 | 2.9% |
| mn24 | 6770 | 2.9% |
| mn23 | 6646 | 2.8% |
| mn27 | 5402 | 2.3% |
| mn22 | 5203 | 2.2% |
| qn22 | 5096 | 2.2% |
| mn15 | 4489 | 1.9% |
| mn25 | 4436 | 1.9% |
| mn19 | 4139 | 1.7% |
| Other values (183) | 174396 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 142110 | |
| 2 | 89485 | |
| M | 86454 | |
| B | 86024 | |
| 3 | 73119 | 7.7% |
| 1 | 70450 | 7.4% |
| K | 64848 | 6.8% |
| Q | 55656 | 5.9% |
| 7 | 52790 | 5.6% |
| 4 | 39720 | 4.2% |
| Other values (8) | 186528 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 947184 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 142110 | |
| 2 | 89485 | |
| M | 86454 | |
| B | 86024 | |
| 3 | 73119 | 7.7% |
| 1 | 70450 | 7.4% |
| K | 64848 | 6.8% |
| Q | 55656 | 5.9% |
| 7 | 52790 | 5.6% |
| 4 | 39720 | 4.2% |
| Other values (8) | 186528 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 947184 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 142110 | |
| 2 | 89485 | |
| M | 86454 | |
| B | 86024 | |
| 3 | 73119 | 7.7% |
| 1 | 70450 | 7.4% |
| K | 64848 | 6.8% |
| Q | 55656 | 5.9% |
| 7 | 52790 | 5.6% |
| 4 | 39720 | 4.2% |
| Other values (8) | 186528 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 947184 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 142110 | |
| 2 | 89485 | |
| M | 86454 | |
| B | 86024 | |
| 3 | 73119 | 7.7% |
| 1 | 70450 | 7.4% |
| K | 64848 | 6.8% |
| Q | 55656 | 5.9% |
| 7 | 52790 | 5.6% |
| 4 | 39720 | 4.2% |
| Other values (8) | 186528 |
grade
Categorical
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 123298 |
| Missing (%) | 51.4% |
| Memory size | 1.8 MiB |
| A | |
|---|---|
| B | |
| C | |
| N | 7866 |
| Z | 3458 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | A |
| 3rd row | A |
| 4th row | B |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 80907 | |
| B | 14370 | 6.0% |
| C | 9069 | 3.8% |
| N | 7866 | 3.3% |
| Z | 3458 | 1.4% |
| P | 703 | 0.3% |
| (Missing) | 123298 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 80907 | |
| b | 14370 | 12.3% |
| c | 9069 | 7.8% |
| n | 7866 | 6.8% |
| z | 3458 | 3.0% |
| p | 703 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 80907 | |
| B | 14370 | 12.3% |
| C | 9069 | 7.8% |
| N | 7866 | 6.8% |
| Z | 3458 | 3.0% |
| P | 703 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 116373 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 80907 | |
| B | 14370 | 12.3% |
| C | 9069 | 7.8% |
| N | 7866 | 6.8% |
| Z | 3458 | 3.0% |
| P | 703 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 116373 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 80907 | |
| B | 14370 | 12.3% |
| C | 9069 | 7.8% |
| N | 7866 | 6.8% |
| Z | 3458 | 3.0% |
| P | 703 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 116373 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 80907 | |
| B | 14370 | 12.3% |
| C | 9069 | 7.8% |
| N | 7866 | 6.8% |
| Z | 3458 | 3.0% |
| P | 703 | 0.6% |
grade_date
Date
Missing 
| Distinct | 1148 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 131148 |
| Missing (%) | 54.7% |
| Memory size | 1.8 MiB |
| Minimum | 2019-01-03 00:00:00 |
|---|---|
| Maximum | 2024-09-30 00:00:00 |
Interactions
Correlations
| action | bbl | bin | boro | camis | census_tract | community_board | council_district | critical_flag | grade | inspection_type | latitude | longitude | score | zipcode | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| action | 1.000 | 0.023 | 0.025 | 0.022 | 0.025 | 0.021 | 0.024 | 0.022 | 0.449 | 0.422 | 0.548 | 0.009 | 0.009 | 0.291 | 0.020 |
| bbl | 0.023 | 1.000 | 0.966 | 0.993 | 0.036 | 0.627 | 0.976 | 0.774 | 0.015 | 0.030 | 0.023 | -0.300 | 0.488 | 0.035 | 0.847 |
| bin | 0.025 | 0.966 | 1.000 | 1.000 | 0.041 | 0.577 | 0.957 | 0.755 | 0.014 | 0.033 | 0.027 | -0.360 | 0.497 | 0.031 | 0.835 |
| boro | 0.022 | 0.993 | 1.000 | 1.000 | 0.040 | 0.366 | 1.000 | 0.861 | 0.014 | 0.033 | 0.033 | 0.047 | 0.047 | 0.041 | 0.762 |
| camis | 0.025 | 0.036 | 0.041 | 0.040 | 1.000 | 0.021 | 0.033 | 0.031 | 0.014 | 0.123 | 0.232 | -0.012 | 0.009 | 0.138 | 0.031 |
| census_tract | 0.021 | 0.627 | 0.577 | 0.366 | 0.021 | 1.000 | 0.605 | 0.512 | 0.010 | 0.039 | 0.023 | -0.063 | 0.657 | 0.036 | 0.680 |
| community_board | 0.024 | 0.976 | 0.957 | 1.000 | 0.033 | 0.605 | 1.000 | 0.783 | 0.016 | 0.032 | 0.024 | -0.356 | 0.530 | 0.037 | 0.852 |
| council_district | 0.022 | 0.774 | 0.755 | 0.861 | 0.031 | 0.512 | 0.783 | 1.000 | 0.015 | 0.042 | 0.025 | -0.633 | 0.173 | 0.021 | 0.718 |
| critical_flag | 0.449 | 0.015 | 0.014 | 0.014 | 0.014 | 0.010 | 0.016 | 0.015 | 1.000 | 0.125 | 0.578 | 0.012 | 0.012 | 0.124 | 0.016 |
| grade | 0.422 | 0.030 | 0.033 | 0.033 | 0.123 | 0.039 | 0.032 | 0.042 | 0.125 | 1.000 | 0.477 | 0.018 | 0.018 | 0.500 | 0.026 |
| inspection_type | 0.548 | 0.023 | 0.027 | 0.033 | 0.232 | 0.023 | 0.024 | 0.025 | 0.578 | 0.477 | 1.000 | 0.024 | 0.024 | 0.066 | 0.026 |
| latitude | 0.009 | -0.300 | -0.360 | 0.047 | -0.012 | -0.063 | -0.356 | -0.633 | 0.012 | 0.018 | 0.024 | 1.000 | 0.301 | -0.001 | -0.328 |
| longitude | 0.009 | 0.488 | 0.497 | 0.047 | 0.009 | 0.657 | 0.530 | 0.173 | 0.012 | 0.018 | 0.024 | 0.301 | 1.000 | 0.033 | 0.637 |
| score | 0.291 | 0.035 | 0.031 | 0.041 | 0.138 | 0.036 | 0.037 | 0.021 | 0.124 | 0.500 | 0.066 | -0.001 | 0.033 | 1.000 | 0.045 |
| zipcode | 0.020 | 0.847 | 0.835 | 0.762 | 0.031 | 0.680 | 0.852 | 0.718 | 0.016 | 0.026 | 0.026 | -0.328 | 0.637 | 0.045 | 1.000 |
Missing values
Sample
| camis | dba | boro | building | street | zipcode | phone | cuisine_description | inspection_date | action | violation_code | violation_description | critical_flag | score | record_date | inspection_type | latitude | longitude | community_board | council_district | census_tract | bin | bbl | nta | grade | grade_date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 50099353 | LA AURORA | Queens | 23917 | BRADDOCK AVE | 11426.0 | 7183474271 | Spanish | 2023-12-22T00:00:00.000 | Violations were cited in the following area(s). | 04L | Evidence of mice or live mice in establishment's food or non-food areas. | Critical | 44.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.726551 | -73.728606 | 413.0 | 23.0 | 162100.0 | 4167536.0 | 4.079870e+09 | QN43 | NaN | NaN |
| 1 | 50118899 | MAMA'S RESTAURANT | Queens | 3708 | 73RD ST | 11372.0 | 3478917275 | Bangladeshi | 2023-05-10T00:00:00.000 | Violations were cited in the following area(s). | 04A | Food Protection Certificate (FPC) not held by manager or supervisor of food operations. | Critical | 53.0 | 2024-11-18T06:00:10.000 | Pre-permit (Operational) / Re-inspection | 40.748445 | -73.892648 | 403.0 | 25.0 | 29100.0 | 4454471.0 | 4.012830e+09 | QN28 | C | 2023-05-10T00:00:00.000 |
| 2 | 50108824 | NY 99 CENTS FRESH PIZZA | Manhattan | 12 | PERRY STREET | 10014.0 | 9172922325 | Pizza | 2024-09-26T00:00:00.000 | Violations were cited in the following area(s). | 10G | Dishwashing and ware washing: Cleaning and sanitizing of tableware, including dishes, utensils, and equipment deficient. | Not Critical | 26.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.735825 | -74.001259 | 102.0 | 3.0 | 7100.0 | 1010884.0 | 1.006120e+09 | MN23 | NaN | NaN |
| 3 | 50132634 | AUTHENTIC FLAVAZ | Brooklyn | 1377 | EAST NEW YORK AVENUE | 11212.0 | 7189751121 | Caribbean | 2024-06-14T00:00:00.000 | Violations were cited in the following area(s). | 08A | Establishment is not free of harborage or conditions conducive to rodents, insects or other pests. | Not Critical | 34.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.669334 | -73.917623 | 316.0 | 41.0 | 36100.0 | 3039538.0 | 3.014740e+09 | BK79 | NaN | NaN |
| 4 | 50126098 | SUSHI TIME | Queens | 7242 | AUSTIN ST | 11375.0 | 3479560001 | Japanese | 2023-03-21T00:00:00.000 | Violations were cited in the following area(s). | 04L | Evidence of mice or live mice in establishment's food or non-food areas. | Critical | 9.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.718725 | -73.841296 | 406.0 | 29.0 | 73700.0 | 4077933.0 | 4.032558e+09 | QN17 | NaN | NaN |
| 5 | 50101506 | 65 KUHO | Brooklyn | 1701 | 65 STREET | 11204.0 | 7182325688 | Japanese | 2023-06-14T00:00:00.000 | Violations were cited in the following area(s). | 04K | Evidence of rats or live rats in establishment's food or non-food areas. | Critical | 29.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.620303 | -73.992119 | 311.0 | 43.0 | 25200.0 | 3133467.0 | 3.055460e+09 | BK28 | NaN | NaN |
| 6 | 50001769 | ZEN VEGETARIAN HOUSE | Brooklyn | 773 | FLATBUSH AVENUE | 11226.0 | 7182822255 | Vegetarian | 2022-01-13T00:00:00.000 | Violations were cited in the following area(s). | 02B | Hot food item not held at or above 140º F. | Critical | 28.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.653972 | -73.959520 | 314.0 | 40.0 | 79602.0 | 3116183.0 | 3.050640e+09 | BK60 | NaN | NaN |
| 7 | 50000071 | GEORGIA DINER | Queens | 80-26 | QUEENS BOULEVARD | 11373.0 | 7186519000 | American | 2024-04-18T00:00:00.000 | Violations were cited in the following area(s). | 02B | Hot TCS food item not held at or above 140 °F. | Critical | 32.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.737681 | -73.882258 | 404.0 | 30.0 | 47900.0 | 4057121.0 | 4.024720e+09 | QN50 | NaN | NaN |
| 8 | 41616179 | DARO'S PIZZA | Queens | 44-25 | KISSENA BOULEVARD | 11355.0 | 7184455573 | Pizza | 2022-04-14T00:00:00.000 | Violations were cited in the following area(s). | 08A | Facility not vermin proof. Harborage or conditions conducive to attracting vermin to the premises and/or allowing vermin to exist. | Not Critical | 42.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.753377 | -73.822170 | 407.0 | 20.0 | 85900.0 | 4117219.0 | 4.051920e+09 | QN22 | NaN | NaN |
| 9 | 41147318 | FEENEY'S PUB | Brooklyn | 6201 | 5 AVENUE | 11220.0 | 9177547597 | American | 2022-11-01T00:00:00.000 | Violations were cited in the following area(s). | 04L | Evidence of mice or live mice in establishment's food or non-food areas. | Critical | 27.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.638247 | -74.017432 | 307.0 | 38.0 | 12200.0 | 3144010.0 | 3.058010e+09 | BK32 | NaN | NaN |
| camis | dba | boro | building | street | zipcode | phone | cuisine_description | inspection_date | action | violation_code | violation_description | critical_flag | score | record_date | inspection_type | latitude | longitude | community_board | council_district | census_tract | bin | bbl | nta | grade | grade_date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 239661 | 50133971 | ALL IN | Queens | 4508 | PARSONS BLVD | 11355.0 | 7187220992 | Asian/Asian Fusion | 2023-04-10T00:00:00.000 | Violations were cited in the following area(s). | 04K | Evidence of rats or live rats in establishment's food or non-food areas. | Critical | 24.0 | 2024-11-18T06:00:10.000 | Pre-permit (Operational) / Initial Inspection | 40.755483 | -73.815440 | 407.0 | 20.0 | 120100.0 | 4117536.0 | 4.052050e+09 | QN52 | NaN | NaN |
| 239662 | 50127731 | AVOCADO SUSHI | Staten Island | 4906 | ARTHUR KILL ROAD | 10309.0 | 3474051970 | Japanese | 2024-03-04T00:00:00.000 | Violations were cited in the following area(s). | 02B | Hot TCS food item not held at or above 140 °F. | Critical | 11.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Re-inspection | 40.522714 | -74.239249 | 503.0 | 51.0 | 22600.0 | 5087205.0 | 5.075840e+09 | SI11 | A | 2024-03-04T00:00:00.000 |
| 239663 | 50105066 | MAYAS SNACK BAR | Brooklyn | 310 | SAINT NICHOLAS AVENUE | 11237.0 | 3478895529 | Frozen Desserts | 2023-04-24T00:00:00.000 | Violations were cited in the following area(s). | 20-04 | “Choking first aid” poster not posted. “Alcohol and Pregnancy” warning sign not posted. Resuscitation equipment: exhaled air resuscitation masks (adult & pediatric), latex gloves, sign not posted. | Not Critical | NaN | 2024-11-18T06:00:10.000 | Administrative Miscellaneous / Initial Inspection | 40.701099 | -73.910796 | 304.0 | 37.0 | 43900.0 | 3076401.0 | 3.033380e+09 | BK77 | NaN | NaN |
| 239664 | 50070527 | BLACK STONE COFFEE ROASTERS | Manhattan | 502 | HUDSON STREET | 10014.0 | 2129896131 | American | 2022-01-25T00:00:00.000 | Violations were cited in the following area(s). | 04J | Appropriately scaled metal stem-type thermometer or thermocouple not provided or used to evaluate temperatures of potentially hazardous foods during cooking, cooling, reheating and holding. | Critical | 0.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Re-inspection | 40.733149 | -74.006379 | 102.0 | 3.0 | 7300.0 | 1011133.0 | 1.006190e+09 | MN23 | A | 2022-01-25T00:00:00.000 |
| 239665 | 50131039 | WING LUCK | Brooklyn | 252 | LIVONIA AVENUE | 11212.0 | 7183852100 | Chinese | 2024-07-25T00:00:00.000 | Violations were cited in the following area(s). | 02G | Cold TCS food item held above 41 °F; smoked or processed fish held above 38 °F; intact raw eggs held above 45 °F; or reduced oxygen packaged (ROP) TCS foods held above required temperatures except during active necessary preparation. | Critical | 33.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.662594 | -73.908545 | 316.0 | 41.0 | 91800.0 | 3082153.0 | 3.035900e+09 | BK81 | NaN | NaN |
| 239666 | 50106128 | AFTERNOON | Manhattan | 33 | WEST 32 STREET | 10001.0 | 3475426323 | Korean | 2022-02-22T00:00:00.000 | Violations were cited in the following area(s). | 10B | Plumbing not properly installed or maintained; anti-siphonage or backflow prevention device not provided where required; equipment or floor not properly drained; sewage disposal system in disrepair or not functioning properly. | Not Critical | 0.0 | 2024-11-18T06:00:10.000 | Pre-permit (Operational) / Initial Inspection | 40.747594 | -73.986531 | 105.0 | 4.0 | 7600.0 | 1015846.0 | 1.008340e+09 | MN17 | NaN | NaN |
| 239667 | 50061229 | BEBE FRITAY | Brooklyn | 1464 | ROCKAWAY PARKWAY | 11236.0 | 6462473242 | Caribbean | 2024-01-11T00:00:00.000 | Violations were cited in the following area(s). | 04L | Evidence of mice or live mice in establishment's food or non-food areas. | Critical | 28.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.644588 | -73.901727 | 318.0 | 46.0 | 96800.0 | 3229685.0 | 3.081840e+09 | BK50 | NaN | NaN |
| 239668 | 50102277 | KUNG FU TEA | Manhattan | 73 | CHRYSTIE STREET | 10002.0 | 9175828883 | Coffee/Tea | 2021-10-05T00:00:00.000 | Violations were cited in the following area(s). | 08C | Pesticide use not in accordance with label or applicable laws. Prohibited chemical used/stored. Open bait station used. | Not Critical | 7.0 | 2024-11-18T06:00:10.000 | Pre-permit (Operational) / Initial Inspection | 40.717147 | -73.994372 | 103.0 | 1.0 | 1600.0 | 1003945.0 | 1.003040e+09 | MN27 | A | 2021-10-05T00:00:00.000 |
| 239669 | 50057589 | TOUS LES JOURS | Queens | 3916 | PRINCE ST | 11354.0 | 7188881992 | Bakery Products/Desserts | 2022-01-27T00:00:00.000 | Violations were cited in the following area(s). | 08C | Pesticide use not in accordance with label or applicable laws. Prohibited chemical used/stored. Open bait station used. | Not Critical | 24.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Re-inspection | 40.759478 | -73.832243 | 407.0 | 20.0 | 87100.0 | 4539244.0 | 4.049738e+09 | QN22 | B | 2022-01-27T00:00:00.000 |
| 239670 | 50105466 | LE PAIN QUOTIDIEN | Manhattan | 41 | W 40TH ST | 10018.0 | 6462767589 | Other | 2024-06-25T00:00:00.000 | Violations were cited in the following area(s). | 08A | Establishment is not free of harborage or conditions conducive to rodents, insects or other pests. | Not Critical | 31.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.753069 | -73.983881 | 105.0 | 4.0 | 8400.0 | NaN | 1.000000e+00 | MN17 | NaN | NaN |
Duplicate rows
Most frequently occurring
| camis | dba | boro | building | street | zipcode | phone | cuisine_description | inspection_date | action | violation_code | violation_description | critical_flag | score | record_date | inspection_type | latitude | longitude | community_board | council_district | census_tract | bin | bbl | nta | grade | grade_date | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 40365904 | MEE SUM CAFE | Manhattan | 26 | PELL STREET | 10013.0 | 2123495260 | Coffee/Tea | 2022-09-30T00:00:00.000 | Violations were cited in the following area(s). | 02I | TCS food removed from cold holding or prepared from or combined with ingredients at room temperature not cooled by an approved method to 41 °F or below within 4 hours. | Critical | 59.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Re-inspection | 40.714861 | -73.998200 | 103.0 | 1.0 | 2900.0 | 1001782.0 | 1.001630e+09 | MN27 | C | 2022-09-30T00:00:00.000 | 2 |
| 1 | 40369016 | VIAND COFFEE SHOP | Manhattan | 673 | MADISON AVENUE | 10065.0 | 2127516622 | Greek | 2022-09-19T00:00:00.000 | Violations were cited in the following area(s). | 06C | Food, supplies, and equipment not protected from potential source of contamination during storage, preparation, transportation, display or service. | Critical | 8.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Re-inspection | 40.764924 | -73.970416 | 108.0 | 4.0 | 11401.0 | 1040850.0 | 1.013760e+09 | MN40 | A | 2022-09-19T00:00:00.000 | 2 |
| 2 | 40369016 | VIAND COFFEE SHOP | Manhattan | 673 | MADISON AVENUE | 10065.0 | 2127516622 | Greek | 2023-04-18T00:00:00.000 | Violations were cited in the following area(s). | 04L | Evidence of mice or live mice in establishment's food or non-food areas. | Critical | 12.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.764924 | -73.970416 | 108.0 | 4.0 | 11401.0 | 1040850.0 | 1.013760e+09 | MN40 | A | 2023-04-18T00:00:00.000 | 2 |
| 3 | 40369521 | KNICKERBOCKER BAR & GRILL | Manhattan | 33 | UNIVERSITY PLACE | 10003.0 | 2122288490 | American | 2023-11-14T00:00:00.000 | Violations were cited in the following area(s). | 02B | Hot TCS food item not held at or above 140 °F. | Critical | 11.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.731969 | -73.994451 | 102.0 | 2.0 | 5900.0 | 1009090.0 | 1.005608e+09 | MN23 | A | 2023-11-14T00:00:00.000 | 2 |
| 4 | 40372445 | OMONIA CAFE | Queens | 32-20 | BROADWAY | 11106.0 | 7182746650 | Greek | 2024-08-26T00:00:00.000 | Violations were cited in the following area(s). | 04M | Live roaches in facility's food or non-food area. | Critical | 27.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Re-inspection | 40.761477 | -73.924360 | 401.0 | 22.0 | 5900.0 | 4008332.0 | 4.006120e+09 | QN70 | Z | 2024-08-26T00:00:00.000 | 2 |
| 5 | 40378035 | DUNKIN', BASKIN ROBBINS | Queens | 15367 | HORACE HARDING EXPRESSWAY | 11367.0 | 7183584031 | Donuts | 2022-08-22T00:00:00.000 | Violations were cited in the following area(s). | 08A | Establishment is not free of harborage or conditions conducive to rodents, insects or other pests. | Not Critical | 22.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.739396 | -73.815741 | 407.0 | 20.0 | 83700.0 | 4141524.0 | 4.064410e+09 | QN62 | NaN | NaN | 2 |
| 6 | 40378212 | CUCHIFRITO | Manhattan | 168 | EAST 116 STREET | 10029.0 | 2128764846 | Spanish | 2024-06-03T00:00:00.000 | Violations were cited in the following area(s). | 20-06 | Current letter grade or Grade Pending card not posted | Not Critical | NaN | 2024-11-18T06:00:10.000 | Administrative Miscellaneous / Initial Inspection | 40.798286 | -73.940864 | 111.0 | 8.0 | 18200.0 | 1052256.0 | 1.016430e+09 | MN34 | NaN | NaN | 2 |
| 7 | 40379580 | COOPER TOWN DINER | Manhattan | 339 | 1 AVENUE | 10003.0 | 2126779287 | American | 2022-08-09T00:00:00.000 | Violations were cited in the following area(s). | 10B | Anti-siphonage or back-flow prevention device not provided where required; equipment or floor not properly drained; sewage disposal system in disrepair or not functioning properly. Condensation or liquid waste improperly disposed of. | Not Critical | 12.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.734753 | -73.979948 | 106.0 | 2.0 | 6400.0 | 1020526.0 | 1.009250e+09 | MN21 | A | 2022-08-09T00:00:00.000 | 2 |
| 8 | 40380628 | PISA PIZZERIA | Queens | 6568 | MYRTLE AVENUE | 11385.0 | 7183816368 | Pizza | 2023-03-22T00:00:00.000 | Violations were cited in the following area(s). | 02B | Hot TCS food item not held at or above 140 °F. | Critical | 12.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.701313 | -73.888255 | 405.0 | 30.0 | 62900.0 | 4090148.0 | 4.036980e+09 | QN19 | A | 2023-03-22T00:00:00.000 | 2 |
| 9 | 40380826 | FIT CAFETERIA (BUILDING A ) | Manhattan | 227 | WEST 27 STREET | 10001.0 | 2122175770 | American | 2022-02-14T00:00:00.000 | Violations were cited in the following area(s). | 09C | Food contact surface not properly maintained. | Not Critical | 8.0 | 2024-11-18T06:00:10.000 | Cycle Inspection / Initial Inspection | 40.747120 | -73.994991 | 105.0 | 3.0 | 9500.0 | 1014251.0 | 1.007770e+09 | MN17 | A | 2022-02-14T00:00:00.000 | 2 |